; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039848 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039848
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionF-box protein At1g67340
Genome locationchr13:372845..376925
RNA-Seq ExpressionLag0039848
SyntenyLag0039848
Gene Ontology termsGO:0006414 - translational elongation (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0003746 - translation elongation factor activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR002893 - Zinc finger, MYND-type
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR036047 - F-box-like domain superfamily
IPR044508 - F-box protein At5g50450/At1g67340-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601598.1 F-box protein, partial [Cucurbita argyrosperma subsp. sororia]2.9e-19487.92Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHD---AAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLN
        MRTRRGLCYP  +LQ+ F SD R+RKRTH+   AAADR FCRK+NK +PDI  P SDLFDSLPDDLVISILSKLSS AS P++FINILLTCKRLN LGLN
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHD---AAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLN

Query:  PIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCA
        P+VLSRAS K FAI ARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI SHAPALYSLAVIQFNGSGGSKNDK+LRAGVALCA
Subjt:  PIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCA

Query:  RAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQ
        RAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ SSV AS +CLTWN   HPHHRHVTGS CPLLSDFGCNIPAPE HPASQ
Subjt:  RAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQ

Query:  FLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        FLAEWF ARGGSPG GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC PVERWLDDNGDGGA+G DD+M ES
Subjt:  FLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

KAG7034708.1 F-box protein [Cucurbita argyrosperma subsp. argyrosperma]2.9e-19487.92Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHD---AAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLN
        MRTRRGLCYP  +LQ+ F SD R+RKRTH+   AAADR FCRK+NK +PDI  P SDLFDSLPDDLVISILSKLSS AS P++FINILLTCKRLN LGLN
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHD---AAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLN

Query:  PIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCA
        P+VLSRAS K FAI ARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI SHAPALYSLAVIQFNGSGGSKNDK+LRAGVALCA
Subjt:  PIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCA

Query:  RAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQ
        RAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ SSV AS +CLTWN   HPHHRHVTGS CPLLSDFGCNIPAPE HPASQ
Subjt:  RAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQ

Query:  FLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        FLAEWF ARGGSPG GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC PVERWLDDNGDGGA+G DD+M ES
Subjt:  FLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

XP_022957417.1 F-box protein At1g67340 [Cucurbita moschata]8.4e-19488.14Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAA--ADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNP
        MRTRRGLCYP  +LQ  F SD R+RKRTHDAA  ADR FCRK+NK +PDI  P SDLFDSLPDDLVISILSKLSS AS P++FINILLTCKRLN LGLNP
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAA--ADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNP

Query:  IVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCAR
        +VLSRAS K FAI ARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI SHAPALYSLAVIQFNGSGGSKNDK+L AGVALCAR
Subjt:  IVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCAR

Query:  AAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQF
        AAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ SSV AS +CLTWN   HPHHRHVTGS CPLLSDFGCNIPAPE HPASQF
Subjt:  AAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQF

Query:  LAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        LAEWF ARGGSPG GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC PVERWLDDNGDGGA+G DD+M ES
Subjt:  LAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

XP_022998094.1 F-box protein At1g67340 [Cucurbita maxima]1.2e-19287.56Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV
        MRTRRGLCYP  +L   F SD R+RKRTHDA+ADR FCRK+NKL+PDI  P SDLFDSLPDDLVISILSKL + AS  ++FINILLTCKRLN LGLNPIV
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV

Query:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
        LSRAS K FAI ARNWTESAHRFLKQC+DAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDK LRAGVALCARAA
Subjt:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA

Query:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA
        FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ SSV AS + LTWN   HPHHRHVTGS CPLLSDFGCNIPAPE HPASQFLA
Subjt:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA

Query:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        EWF ARGGSPG GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC PVERWLDDNGDGGA+G  D+M ES
Subjt:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

XP_038893304.1 F-box protein At1g67340 [Benincasa hispida]5.9e-19588.6Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV
        MRTR GL YP +Q   T  SD   RKRTH  AADR FCRK+NKL+ D   P +DLFDSLPDDL+ISILS LSS AS P+DFINILLTCKRLN LGLNP+V
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV

Query:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
        LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
Subjt:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA

Query:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA
        FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ASSV AS SCLTW+ H  PHHRH+TGSGCPLLSDFGCNIPAPE HPASQFLA
Subjt:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA

Query:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        EWF ARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLD+NGDGGA+GAD IM ES
Subjt:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

TrEMBL top hitse value%identityAlignment
A0A0A0KTC5 MYND-type domain-containing protein2.1e-19087.82Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV
        MRTR GL YP +Q  T F S    RKR H  AADR FCRK+NKL+  I  P SDLFDSLPDDLVI+ILS LSS AS P+DFINILLTCKRLN LGLNP+V
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV

Query:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
        LSRASQKTFAIRA+NWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
Subjt:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA

Query:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA
        FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ASSV AS SCLTWN+   PHHRHVTGSGCPLLSDFGCNIPAPE HPASQFLA
Subjt:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA

Query:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        EWF ARGGSPG+GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHK+DCAPVERWLDDNGDG  + ADDIM ES
Subjt:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

A0A1S3BE37 F-box protein At1g673402.3e-18987.56Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV
        MRTR GL YP +Q  TTF S    RKR    AADR FCRK+NK + DI  P SDLFDSLPDDLVI+ILS   S AS P+DFINILLTCKRLN LGLNP+V
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV

Query:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
        LSRASQKTFAIRA+NWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
Subjt:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA

Query:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA
        FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ASSV AS SCLTWN+H  PHHRHVTGSGCPLLSDFGCNIPAPE HPASQFLA
Subjt:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA

Query:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        EWF ARGGSPG+GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGD   + ADDIM ES
Subjt:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

A0A5A7SUQ1 F-box protein8.0e-19087.82Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV
        MRTR GL YP +Q  TTF S    RKR    AADR FCRK+NK + DI  P SDLFDSLPDDLVI+ILS L S AS P+DFINILLTCKRLN LGLNP+V
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV

Query:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
        LSRASQKTFAIRA+NWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
Subjt:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA

Query:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA
        FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ASSV AS SCLTWN+H  PHHRHVTGSGCPLLSDFGCNIPAPE HPASQFLA
Subjt:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA

Query:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        EWF ARGGSPG+GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGD   + ADDIM ES
Subjt:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

A0A6J1GZ25 F-box protein At1g673404.1e-19488.14Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAA--ADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNP
        MRTRRGLCYP  +LQ  F SD R+RKRTHDAA  ADR FCRK+NK +PDI  P SDLFDSLPDDLVISILSKLSS AS P++FINILLTCKRLN LGLNP
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAA--ADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNP

Query:  IVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCAR
        +VLSRAS K FAI ARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAI SHAPALYSLAVIQFNGSGGSKNDK+L AGVALCAR
Subjt:  IVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCAR

Query:  AAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQF
        AAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ SSV AS +CLTWN   HPHHRHVTGS CPLLSDFGCNIPAPE HPASQF
Subjt:  AAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQF

Query:  LAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        LAEWF ARGGSPG GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC PVERWLDDNGDGGA+G DD+M ES
Subjt:  LAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

A0A6J1KDE5 F-box protein At1g673405.9e-19387.56Show/hide
Query:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV
        MRTRRGLCYP  +L   F SD R+RKRTHDA+ADR FCRK+NKL+PDI  P SDLFDSLPDDLVISILSKL + AS  ++FINILLTCKRLN LGLNPIV
Subjt:  MRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIV

Query:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA
        LSRAS K FAI ARNWTESAHRFLKQC+DAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDK LRAGVALCARAA
Subjt:  LSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAA

Query:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA
        FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSE+ SSV AS + LTWN   HPHHRHVTGS CPLLSDFGCNIPAPE HPASQFLA
Subjt:  FLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLA

Query:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES
        EWF ARGGSPG GLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC PVERWLDDNGDGGA+G  D+M ES
Subjt:  EWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES

SwissProt top hitse value%identityAlignment
Q2YDC9 Programmed cell death protein 29.0e-0531.82Show/hide
Query:  PLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE
        P  +DF    P  E  P+    + +   + G+      LC   GC  P       +RCS C   +YCS+  Q+LDW+L HK  CA  +
Subjt:  PLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE

Q9FK27 F-box protein At5g504506.9e-10656.32Show/hide
Query:  KKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTL
        KK +L  +     ++ F+ L DDL+ISIL KL++ AS P+DF+ +L TCKRLN LGL+P+VLS+A  +T A+ A  W++S+H+FLK C +AGN++A Y+L
Subjt:  KKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTL

Query:  GMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARE
        GMIRFYCLQN  SGASLMAKAAI SHAPALYSL+VIQFNGSGGSK DK+LRAGVALCAR+A+LGH+DALRELGHCLQDGYGV ++++EGRR L+QANARE
Subjt:  GMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARE

Query:  LAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCG
        LA  L S  +                       +G     L+D    +P  E HP ++FL EWF +       GLR+CSH GCGRPETR HEFRRCSVCG
Subjt:  LAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCG

Query:  AVNYCSRACQALDWKLRHKMDCAPVERW------LDDNGDGGAEGADD
         VNYCSR CQALDW+ +HK++C P++ W      + D+G+  A   DD
Subjt:  AVNYCSRACQALDWKLRHKMDCAPVERW------LDDNGDGGAEGADD

Q9FPS9 Ubiquitin carboxyl-terminal hydrolase 157.6e-0426.85Show/hide
Query:  GVRQNITEGR---RFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRL
        G   NI+E R     L Q  A E     +  +A   V   S  T N       + V+  G  + ++F            S  +    G    +P +   L
Subjt:  GVRQNITEGR---RFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRL

Query:  CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE
             C  P        RCS C +V YCS  CQ + W++ HK +C PVE
Subjt:  CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE

Q9FYF9 F-box protein At1g673405.4e-14376.76Show/hide
Query:  SDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGS
        +DL DS+PDDLVISIL KL S + CPADFIN+LLTCKRL  L +NPIVLSR S K  A++A NW+E +HRFLK+C DAG++EACYTLGMIRFYCLQNRG+
Subjt:  SDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGS

Query:  GASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSV
        GASLMAKAAI SHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGH+DALRELGHCLQDGYGV QN++EGRRFLVQANARELAAVLSS   +  
Subjt:  GASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSV

Query:  VASGSCLTWNSHAHPHHRHVTGSG---CPLLSDFGCNIPAPETHPASQFLAEWFGARGGS-PGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRAC
               TW S + P    V   G   CPLLSDFGCN+PAPETHPA++FLA+WF  RGG  PG GLRLCSH GCGRPETR+HEFRRCSVCG VNYCSRAC
Subjt:  VASGSCLTWNSHAHPHHRHVTGSG---CPLLSDFGCNIPAPETHPASQFLAEWFGARGGS-PGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRAC

Query:  QALDWKLRHKMDCAPVERWLDDNGDGG
        QALDWKLRHKMDCAPV+RWL++ GDGG
Subjt:  QALDWKLRHKMDCAPVERWLDDNGDGG

Arabidopsis top hitse value%identityAlignment
AT1G17110.1 ubiquitin-specific protease 155.4e-0526.85Show/hide
Query:  GVRQNITEGR---RFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRL
        G   NI+E R     L Q  A E     +  +A   V   S  T N       + V+  G  + ++F            S  +    G    +P +   L
Subjt:  GVRQNITEGR---RFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRL

Query:  CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE
             C  P        RCS C +V YCS  CQ + W++ HK +C PVE
Subjt:  CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE

AT1G17110.2 ubiquitin-specific protease 155.4e-0526.85Show/hide
Query:  GVRQNITEGR---RFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRL
        G   NI+E R     L Q  A E     +  +A   V   S  T N       + V+  G  + ++F            S  +    G    +P +   L
Subjt:  GVRQNITEGR---RFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRL

Query:  CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE
             C  P        RCS C +V YCS  CQ + W++ HK +C PVE
Subjt:  CSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVE

AT1G67340.1 HCP-like superfamily protein with MYND-type zinc finger3.8e-14476.76Show/hide
Query:  SDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGS
        +DL DS+PDDLVISIL KL S + CPADFIN+LLTCKRL  L +NPIVLSR S K  A++A NW+E +HRFLK+C DAG++EACYTLGMIRFYCLQNRG+
Subjt:  SDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGS

Query:  GASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSV
        GASLMAKAAI SHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGH+DALRELGHCLQDGYGV QN++EGRRFLVQANARELAAVLSS   +  
Subjt:  GASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSV

Query:  VASGSCLTWNSHAHPHHRHVTGSG---CPLLSDFGCNIPAPETHPASQFLAEWFGARGGS-PGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRAC
               TW S + P    V   G   CPLLSDFGCN+PAPETHPA++FLA+WF  RGG  PG GLRLCSH GCGRPETR+HEFRRCSVCG VNYCSRAC
Subjt:  VASGSCLTWNSHAHPHHRHVTGSG---CPLLSDFGCNIPAPETHPASQFLAEWFGARGGS-PGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRAC

Query:  QALDWKLRHKMDCAPVERWLDDNGDGG
        QALDWKLRHKMDCAPV+RWL++ GDGG
Subjt:  QALDWKLRHKMDCAPVERWLDDNGDGG

AT2G24640.1 ubiquitin-specific protease 191.2e-0447.5Show/hide
Query:  CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC
        CG+  T     ++CS C +V YCS ACQ  DWK  HK+ C
Subjt:  CGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDC

AT5G50450.1 HCP-like superfamily protein with MYND-type zinc finger4.9e-10756.32Show/hide
Query:  KKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTL
        KK +L  +     ++ F+ L DDL+ISIL KL++ AS P+DF+ +L TCKRLN LGL+P+VLS+A  +T A+ A  W++S+H+FLK C +AGN++A Y+L
Subjt:  KKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCKRLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTL

Query:  GMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARE
        GMIRFYCLQN  SGASLMAKAAI SHAPALYSL+VIQFNGSGGSK DK+LRAGVALCAR+A+LGH+DALRELGHCLQDGYGV ++++EGRR L+QANARE
Subjt:  GMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARAAFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARE

Query:  LAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCG
        LA  L S  +                       +G     L+D    +P  E HP ++FL EWF +       GLR+CSH GCGRPETR HEFRRCSVCG
Subjt:  LAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGSPGHGLRLCSHVGCGRPETRRHEFRRCSVCG

Query:  AVNYCSRACQALDWKLRHKMDCAPVERW------LDDNGDGGAEGADD
         VNYCSR CQALDW+ +HK++C P++ W      + D+G+  A   DD
Subjt:  AVNYCSRACQALDWKLRHKMDCAPVERW------LDDNGDGGAEGADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATGGCGTGGCGTTATCTTGGCTGAAGGGATTAGCTTCGGATGCACAGTCTGACGCTGCTATCTGTCCACGTGTCAGAATTCGAAAGCGTTTCAAGATGAACGA
GGTGGAACTAGGGTGGCCCAAGGCAGCAGGAATCGAACAGAGGCGAAAGAACTTGGCTTGCGCAAGCGGACCGTTGGCTCGACCCTTTGGCAAGTCGAACACGGCCCTTT
CCGATGTGGGACCCCACCGATTAAATACCCATTTCCAGCAGTCTTCAGAACTGGAAAAAAATGCTTCTCTACTTCGAGAAGCAGCCACGAAGGGGACAGAGAGAAAAGAG
AGAGTCGTCGACATCAAACGGAAACAGCCACAAAACAAAGAGGCTCAAGAACCAGATTTATCCATGAGAACTCGAAGAGGCCTATGTTATCCTTCCGTTCAACTTCAAAC
CACCTTCTGTTCCGACACCCGTCTTCGCAAAAGAACTCACGATGCCGCCGCCGACCGCCTCTTTTGCCGGAAAAAGAACAAGCTCAATCCCGACATCATTCCTCCGCCGT
CCGATTTGTTCGATTCCTTGCCCGACGACCTCGTCATTTCTATTCTCTCTAAACTCAGCTCCGGCGCCTCTTGCCCCGCCGATTTCATCAACATCTTGCTAACATGCAAA
AGATTAAACTGTTTAGGGCTCAATCCAATTGTATTATCAAGAGCATCACAGAAGACGTTTGCAATTAGGGCAAGAAACTGGACGGAATCGGCTCACCGGTTTCTGAAACA
GTGTTCCGATGCCGGCAACGTCGAGGCCTGTTATACACTCGGCATGATTCGTTTCTACTGTTTGCAAAACCGAGGCAGCGGCGCTTCGCTGATGGCCAAGGCGGCGATTT
GTTCCCACGCGCCGGCGCTTTACTCACTCGCCGTCATTCAGTTCAACGGCAGCGGCGGCTCCAAAAACGACAAGGACCTTCGCGCTGGAGTTGCCCTGTGCGCACGTGCG
GCTTTCCTTGGGCATATCGACGCCCTAAGAGAACTCGGCCATTGCCTTCAAGACGGTTATGGCGTTCGCCAGAACATAACGGAGGGCCGACGGTTTCTTGTCCAGGCCAA
CGCGCGTGAACTCGCCGCCGTGCTCTCGTCGGAGTCGGCTTCTTCCGTTGTCGCATCGGGTTCGTGTCTCACTTGGAACTCTCACGCTCATCCTCACCACCGACACGTGA
CGGGCTCGGGGTGTCCATTGTTGAGCGATTTCGGTTGCAATATCCCGGCGCCGGAGACTCACCCGGCGAGTCAGTTTTTGGCGGAGTGGTTTGGGGCACGTGGGGGCTCT
CCGGGGCACGGGTTGAGGCTATGCTCGCACGTGGGATGTGGACGGCCGGAGACGAGACGGCACGAGTTCCGTCGATGTTCCGTTTGTGGTGCCGTTAACTACTGCTCACG
TGCGTGTCAGGCGTTGGATTGGAAACTCCGCCATAAAATGGATTGTGCTCCGGTAGAACGTTGGCTTGATGATAACGGTGACGGTGGGGCCGAAGGGGCCGATGATATCA
TGGCCGAAAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATGGCGTGGCGTTATCTTGGCTGAAGGGATTAGCTTCGGATGCACAGTCTGACGCTGCTATCTGTCCACGTGTCAGAATTCGAAAGCGTTTCAAGATGAACGA
GGTGGAACTAGGGTGGCCCAAGGCAGCAGGAATCGAACAGAGGCGAAAGAACTTGGCTTGCGCAAGCGGACCGTTGGCTCGACCCTTTGGCAAGTCGAACACGGCCCTTT
CCGATGTGGGACCCCACCGATTAAATACCCATTTCCAGCAGTCTTCAGAACTGGAAAAAAATGCTTCTCTACTTCGAGAAGCAGCCACGAAGGGGACAGAGAGAAAAGAG
AGAGTCGTCGACATCAAACGGAAACAGCCACAAAACAAAGAGGCTCAAGAACCAGATTTATCCATGAGAACTCGAAGAGGCCTATGTTATCCTTCCGTTCAACTTCAAAC
CACCTTCTGTTCCGACACCCGTCTTCGCAAAAGAACTCACGATGCCGCCGCCGACCGCCTCTTTTGCCGGAAAAAGAACAAGCTCAATCCCGACATCATTCCTCCGCCGT
CCGATTTGTTCGATTCCTTGCCCGACGACCTCGTCATTTCTATTCTCTCTAAACTCAGCTCCGGCGCCTCTTGCCCCGCCGATTTCATCAACATCTTGCTAACATGCAAA
AGATTAAACTGTTTAGGGCTCAATCCAATTGTATTATCAAGAGCATCACAGAAGACGTTTGCAATTAGGGCAAGAAACTGGACGGAATCGGCTCACCGGTTTCTGAAACA
GTGTTCCGATGCCGGCAACGTCGAGGCCTGTTATACACTCGGCATGATTCGTTTCTACTGTTTGCAAAACCGAGGCAGCGGCGCTTCGCTGATGGCCAAGGCGGCGATTT
GTTCCCACGCGCCGGCGCTTTACTCACTCGCCGTCATTCAGTTCAACGGCAGCGGCGGCTCCAAAAACGACAAGGACCTTCGCGCTGGAGTTGCCCTGTGCGCACGTGCG
GCTTTCCTTGGGCATATCGACGCCCTAAGAGAACTCGGCCATTGCCTTCAAGACGGTTATGGCGTTCGCCAGAACATAACGGAGGGCCGACGGTTTCTTGTCCAGGCCAA
CGCGCGTGAACTCGCCGCCGTGCTCTCGTCGGAGTCGGCTTCTTCCGTTGTCGCATCGGGTTCGTGTCTCACTTGGAACTCTCACGCTCATCCTCACCACCGACACGTGA
CGGGCTCGGGGTGTCCATTGTTGAGCGATTTCGGTTGCAATATCCCGGCGCCGGAGACTCACCCGGCGAGTCAGTTTTTGGCGGAGTGGTTTGGGGCACGTGGGGGCTCT
CCGGGGCACGGGTTGAGGCTATGCTCGCACGTGGGATGTGGACGGCCGGAGACGAGACGGCACGAGTTCCGTCGATGTTCCGTTTGTGGTGCCGTTAACTACTGCTCACG
TGCGTGTCAGGCGTTGGATTGGAAACTCCGCCATAAAATGGATTGTGCTCCGGTAGAACGTTGGCTTGATGATAACGGTGACGGTGGGGCCGAAGGGGCCGATGATATCA
TGGCCGAAAGTTAA
Protein sequenceShow/hide protein sequence
MENGVALSWLKGLASDAQSDAAICPRVRIRKRFKMNEVELGWPKAAGIEQRRKNLACASGPLARPFGKSNTALSDVGPHRLNTHFQQSSELEKNASLLREAATKGTERKE
RVVDIKRKQPQNKEAQEPDLSMRTRRGLCYPSVQLQTTFCSDTRLRKRTHDAAADRLFCRKKNKLNPDIIPPPSDLFDSLPDDLVISILSKLSSGASCPADFINILLTCK
RLNCLGLNPIVLSRASQKTFAIRARNWTESAHRFLKQCSDAGNVEACYTLGMIRFYCLQNRGSGASLMAKAAICSHAPALYSLAVIQFNGSGGSKNDKDLRAGVALCARA
AFLGHIDALRELGHCLQDGYGVRQNITEGRRFLVQANARELAAVLSSESASSVVASGSCLTWNSHAHPHHRHVTGSGCPLLSDFGCNIPAPETHPASQFLAEWFGARGGS
PGHGLRLCSHVGCGRPETRRHEFRRCSVCGAVNYCSRACQALDWKLRHKMDCAPVERWLDDNGDGGAEGADDIMAES