; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g046000 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g046000
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionSURF1-like protein
Genome locationCsor_Chr18:399569..403137
RNA-Seq ExpressionCsor.00g046000
SyntenyCsor.00g046000
Gene Ontology termsGO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002994 - Surfeit locus 1/Shy1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572973.1 Surfeit locus protein 1, partial [Cucurbita argyrosperma subsp. sororia]5.51e-245100Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
        QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

XP_022954894.1 surfeit locus protein 1 [Cucurbita moschata]7.50e-24399.14Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSSFAKSIAKFRPLI LSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEK EILDYRQK
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVN+NSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
        QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

XP_022994231.1 surfeit locus protein 1 [Cucurbita maxima]2.14e-24098.56Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSSFAKSIAKFRP ISL+RHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG E
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPIT-PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY
        QSSD APSMVQESERSSWWKFWSKTTKNLENEVSPIT PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPIT-PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY

Query:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
Subjt:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

XP_023541123.1 surfeit locus protein 1-like isoform X1 [Cucurbita pepo subsp. pepo]2.39e-23797.7Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSSFAKSIAKFRP ISLSRHSATPTPLPSSS SFSSAAAVSSAADPQSSSLSQAQQK+RDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVN+NSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPI GLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLE-NEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY
        QSSD  PSMVQESERSSWWKFWSKTTKNLE NEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLE-NEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY

Query:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
Subjt:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

XP_023541124.1 surfeit locus protein 1-like isoform X2 [Cucurbita pepo subsp. pepo]3.41e-23997.98Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSSFAKSIAKFRP ISLSRHSATPTPLPSSS SFSSAAAVSSAADPQSSSLSQAQQK+RDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVN+NSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPI GLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
        QSSD  PSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

TrEMBL top hitse value%identityAlignment
A0A0A0LVW3 SURF1-like protein2.48e-21387.32Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSS AKSI KFRP  SLS HS+TP  LPSSS SFSSAA VSS  DP SSSLSQ QQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+K
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVN+N+LL L+DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGW PRTWKEKALEV+ QGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
        QSSD  PS+VQ  ERSSWWKFWSK T++LENE++PITPVEVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

A0A1S3CPJ6 SURF1-like protein1.67e-21186.17Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSS AKSI KFRP  SLS HS+TP  LPSSS SFSSAA VSS  DP SSSLSQ QQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+K
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVN+N+LL L+DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMP+PGLPDSVQSPVLVNRGW PRTWKEKALEV+ QGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
        QSS T PS+VQE ERSSWWKFWSK T++LENE++PITPVEVIGV+RTSEKPSIFVPANDP S QWFYVDVPAIAR+SGLPED  YVEDINENVNPS+PYP
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        IPKDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

A0A5D3BJ55 SURF1-like protein1.67e-21186.17Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSS AKSI KFRP  SLS HS+TP  LPSSS SFSSAA VSS  DP SSSLSQ QQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+K
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVN+N+LL L+DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMP+PGLPDSVQSPVLVNRGW PRTWKEKALEV+ QGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
        QSS T PS+VQE ERSSWWKFWSK T++LENE++PITPVEVIGV+RTSEKPSIFVPANDP S QWFYVDVPAIAR+SGLPED  YVEDINENVNPS+PYP
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        IPKDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

A0A6J1GUD0 SURF1-like protein3.63e-24399.14Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSSFAKSIAKFRPLI LSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEK EILDYRQK
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVN+NSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
        QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

A0A6J1JYJ0 SURF1-like protein1.04e-24098.56Show/hide
Query:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
        MASSSFAKSIAKFRP ISL+RHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK
Subjt:  MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQK

Query:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE
        RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG E
Subjt:  RLLMEPVNVNSLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSE

Query:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPIT-PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY
        QSSD APSMVQESERSSWWKFWSKTTKNLENEVSPIT PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY
Subjt:  QSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPIT-PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPY

Query:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
Subjt:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

SwissProt top hitse value%identityAlignment
P09925 Surfeit locus protein 18.4e-2326.8Show/hide
Query:  SSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPLKD-KLDDLEFRRVICKGVFDEKKSIFVGPRS---
        SS+   A  K  D    +W L L  A  FGLGTWQ+ RR+ K++++   + R++ EP+     LP    +L +LE+R V  +G FD  K +++ PR+   
Subjt:  SSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPLKD-KLDDLEFRRVICKGVFDEKKSIFVGPRS---

Query:  -------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSEQSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITP
                     TE+G +V+TP          +   +LVNRG+VPR                                      K       +   +  
Subjt:  -------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSEQSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITP

Query:  VEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIK
        V+++G+VR +E    FVP N P  + W+Y D+ A+A+ +G   D I+++    +  P    PI       +R+     +H+ Y LTWY L AA +++  +
Subjt:  VEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIK

Query:  RLRQKT
        +  ++T
Subjt:  RLRQKT

Q15526 Surfeit locus protein 17.6e-2429.64Show/hide
Query:  SSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPLKD-KLDDLEFRRVICKGVFDEKKSIFVGPRSR--
        SS+   +  K  D    +W+L L     FGLGTWQ+ RR+ K+ ++   + R+L EPV     LP    +L +LE+R V  +G FD  K +++ PR+   
Subjt:  SSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPLKD-KLDDLEFRRVICKGVFDEKKSIFVGPRSR--

Query:  ---------SISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSEQSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPIT
                  IS  T++G YV+TP          +   +LVNRG+VPR  K+   E   +G                              +E E     
Subjt:  ---------SISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSEQSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPIT

Query:  PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAI
         V++IG+VR +E    FVP N+P  + W Y D+ A+AR +G   + I+++   ++  P    PI       +R+     +HL Y +TWY LSAA +++  
Subjt:  PVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAI

Query:  KRLRQKT
        K+  + T
Subjt:  KRLRQKT

Q9LP74 Surfeit locus protein 1-like8.3e-5540.41Show/hide
Query:  FRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSL
        F+ LIS S++         SS + S+  A S  ++ +S  LS A    +  + S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N+ 
Subjt:  FRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSL

Query:  LPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG---SEQSSDTAPSM
           KD LD L FRRV+CKG+FDE++SI+VGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGWVP  WKE +LE    G   +        + 
Subjt:  LPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG---SEQSSDTAPSM

Query:  VQESERSSWWKFWSKTTKNL--ENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNT
        +  S++S   KFW K    +  E++VS    VEV+GVVR SE P I+   N P S  WFY+DVP +A   G  ED +Y+E    +++ S  YP+P+DV  
Subjt:  VQESERSSWWKFWSKTTKNL--ENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNT

Query:  LIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTR
        L RS  +P D+  YT+ W+  S      A   L ++ T+
Subjt:  LIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTR

Q9QXU2 Surfeit locus protein 12.2e-2326.8Show/hide
Query:  SSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPLKD-KLDDLEFRRVICKGVFDEKKSIFVGPRS---
        SS+   A  K  D    +W L    A  FGLGTWQ+ RR+ K++++   + R++ EP+     LP    +L +LE+R V  +G FD  K +++ PR+   
Subjt:  SSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPLKD-KLDDLEFRRVICKGVFDEKKSIFVGPRS---

Query:  -------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSEQSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITP
                     TE+G YV+TP          +   +LVNRG+VPR                                      K       +   +  
Subjt:  -------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSEQSSDTAPSMVQESERSSWWKFWSKTTKNLENEVSPITP

Query:  VEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIK
        V+++G+VR +E    FVP N+P  S W+Y D+ A+A+ +G   D I+++    +  P    PI       +R+     +H+ Y +TWY L AA +++  +
Subjt:  VEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAIK

Query:  RLRQKT
        +  ++T
Subjt:  RLRQKT

Q9SE51 Surfeit locus protein 11.8e-10558.81Show/hide
Query:  SLSRHSATPTPLPSSSFS--FSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPL
        S+S   + P    S  FS    S+++ S+A   QSSS +  Q+ +R SK S+ LLFLPGA+TFGLG+WQI RR+EK + L+Y+Q+RL MEP+ +N   PL
Subjt:  SLSRHSATPTPLPSSSFS--FSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPL

Query:  KDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEK---ALEVDHQGSEQSSDTAPSMVQE
           L+ LEFRRV CKGVFDE++SI++GPRSRSISG+TENG +VITPLMPIPG  DS+QSP+LVNRGWVPR+W+EK   + E +   ++ +   +PS    
Subjt:  KDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEK---ALEVDHQGSEQSSDTAPSMVQE

Query:  SERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSS
        +E  SWWKFWSKT    +  +S + PVEV+GV+R  E PSIFVP+NDP + QWFYVDVPA+AR  GLPE+ IYVED++E+V+ S PYP+PKD+NTLIRS 
Subjt:  SERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSS

Query:  VMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        VMPQDHLNY++TWYSLSAAVTFMA KRL+ K  R+
Subjt:  VMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK

Arabidopsis top hitse value%identityAlignment
AT1G48510.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein5.9e-5640.41Show/hide
Query:  FRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSL
        F+ LIS S++         SS + S+  A S  ++ +S  LS A    +  + S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N+ 
Subjt:  FRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSL

Query:  LPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG---SEQSSDTAPSM
           KD LD L FRRV+CKG+FDE++SI+VGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGWVP  WKE +LE    G   +        + 
Subjt:  LPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG---SEQSSDTAPSM

Query:  VQESERSSWWKFWSKTTKNL--ENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNT
        +  S++S   KFW K    +  E++VS    VEV+GVVR SE P I+   N P S  WFY+DVP +A   G  ED +Y+E    +++ S  YP+P+DV  
Subjt:  VQESERSSWWKFWSKTTKNL--ENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNT

Query:  LIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTR
        L RS  +P D+  YT+ W+  S      A   L ++ T+
Subjt:  LIRSSVMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTR

AT1G48510.2 Surfeit locus 1 cytochrome c oxidase biogenesis protein6.1e-5341.88Show/hide
Query:  FRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSL
        F+ LIS S++         SS + S+  A S  ++ +S  LS A    +  + S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N+ 
Subjt:  FRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSL

Query:  LPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG---SEQSSDTAPSM
           KD LD L FRRV+CKG+FDE++SI+VGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGWVP  WKE +LE    G   +        + 
Subjt:  LPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQG---SEQSSDTAPSM

Query:  VQESERSSWWKFWSKTTKNL--ENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNT
        +  S++S   KFW K    +  E++VS    VEV+GVVR SE P I+   N P S  WFY+DVP +A   G  ED +Y+E    +++ S  YP+P+DV  
Subjt:  VQESERSSWWKFWSKTTKNL--ENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNT

Query:  LIRSSVMP
        L RS+ +P
Subjt:  LIRSSVMP

AT3G17910.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein1.2e-10658.81Show/hide
Query:  SLSRHSATPTPLPSSSFS--FSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPL
        S+S   + P    S  FS    S+++ S+A   QSSS +  Q+ +R SK S+ LLFLPGA+TFGLG+WQI RR+EK + L+Y+Q+RL MEP+ +N   PL
Subjt:  SLSRHSATPTPLPSSSFS--FSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVNSLLPL

Query:  KDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEK---ALEVDHQGSEQSSDTAPSMVQE
           L+ LEFRRV CKGVFDE++SI++GPRSRSISG+TENG +VITPLMPIPG  DS+QSP+LVNRGWVPR+W+EK   + E +   ++ +   +PS    
Subjt:  KDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEK---ALEVDHQGSEQSSDTAPSMVQE

Query:  SERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSS
        +E  SWWKFWSKT    +  +S + PVEV+GV+R  E PSIFVP+NDP + QWFYVDVPA+AR  GLPE+ IYVED++E+V+ S PYP+PKD+NTLIRS 
Subjt:  SERSSWWKFWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSS

Query:  VMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK
        VMPQDHLNY++TWYSLSAAVTFMA KRL+ K  R+
Subjt:  VMPQDHLNYTLTWYSLSAAVTFMAIKRLRQKTTRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCTTCCTTTGCTAAATCCATAGCTAAATTTCGCCCTTTGATTTCCCTTTCTCGGCACAGCGCGACGCCGACGCCTTTACCTTCGTCTTCTTTCTCGTTCAG
CTCTGCGGCTGCAGTTTCATCTGCTGCTGATCCCCAGTCATCTTCGCTTTCTCAAGCTCAACAGAAACGAAGAGATTCGAAATTGTCAAAATGGCTCCTGTTTTTACCTG
GGGCTCTCACGTTTGGCCTTGGAACTTGGCAGATTTTCAGGAGGCAAGAGAAGATAGAAATTCTAGATTACAGGCAGAAGCGACTGTTAATGGAACCTGTGAACGTAAAC
AGCTTACTTCCATTGAAAGACAAGCTAGATGATCTGGAGTTCAGGAGGGTGATCTGTAAAGGAGTTTTTGATGAGAAGAAATCAATCTTCGTTGGTCCACGTTCGAGAAG
CATTTCGGGAGTGACCGAAAATGGCCATTATGTTATTACCCCATTGATGCCCATTCCTGGCCTACCTGATAGTGTGCAGTCACCTGTTTTGGTCAATCGAGGATGGGTTC
CACGCACTTGGAAGGAGAAGGCTTTAGAAGTAGACCATCAAGGTAGTGAACAGTCTTCAGATACTGCACCCTCCATGGTTCAAGAGAGTGAGAGAAGCTCGTGGTGGAAG
TTCTGGTCGAAAACGACCAAGAATCTAGAGAATGAAGTGAGCCCCATAACTCCAGTAGAAGTTATTGGAGTGGTTCGCACAAGTGAAAAGCCTAGCATATTTGTTCCAGC
AAATGATCCTGGCTCTAGCCAATGGTTTTATGTTGATGTTCCAGCAATTGCACGTACTTCTGGACTCCCTGAGGATGCTATTTATGTGGAAGACATAAATGAGAATGTGA
ACCCAAGTAACCCGTATCCGATTCCGAAGGATGTTAATACGTTGATACGAAGTTCCGTTATGCCACAGGATCATCTTAATTACACATTAACATGGTACTCTCTCTCAGCT
GCTGTGACCTTCATGGCAATCAAAAGACTGAGGCAAAAAACGACTCGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTCTTCCTTTGCTAAATCCATAGCTAAATTTCGCCCTTTGATTTCCCTTTCTCGGCACAGCGCGACGCCGACGCCTTTACCTTCGTCTTCTTTCTCGTTCAG
CTCTGCGGCTGCAGTTTCATCTGCTGCTGATCCCCAGTCATCTTCGCTTTCTCAAGCTCAACAGAAACGAAGAGATTCGAAATTGTCAAAATGGCTCCTGTTTTTACCTG
GGGCTCTCACGTTTGGCCTTGGAACTTGGCAGATTTTCAGGAGGCAAGAGAAGATAGAAATTCTAGATTACAGGCAGAAGCGACTGTTAATGGAACCTGTGAACGTAAAC
AGCTTACTTCCATTGAAAGACAAGCTAGATGATCTGGAGTTCAGGAGGGTGATCTGTAAAGGAGTTTTTGATGAGAAGAAATCAATCTTCGTTGGTCCACGTTCGAGAAG
CATTTCGGGAGTGACCGAAAATGGCCATTATGTTATTACCCCATTGATGCCCATTCCTGGCCTACCTGATAGTGTGCAGTCACCTGTTTTGGTCAATCGAGGATGGGTTC
CACGCACTTGGAAGGAGAAGGCTTTAGAAGTAGACCATCAAGGTAGTGAACAGTCTTCAGATACTGCACCCTCCATGGTTCAAGAGAGTGAGAGAAGCTCGTGGTGGAAG
TTCTGGTCGAAAACGACCAAGAATCTAGAGAATGAAGTGAGCCCCATAACTCCAGTAGAAGTTATTGGAGTGGTTCGCACAAGTGAAAAGCCTAGCATATTTGTTCCAGC
AAATGATCCTGGCTCTAGCCAATGGTTTTATGTTGATGTTCCAGCAATTGCACGTACTTCTGGACTCCCTGAGGATGCTATTTATGTGGAAGACATAAATGAGAATGTGA
ACCCAAGTAACCCGTATCCGATTCCGAAGGATGTTAATACGTTGATACGAAGTTCCGTTATGCCACAGGATCATCTTAATTACACATTAACATGGTACTCTCTCTCAGCT
GCTGTGACCTTCATGGCAATCAAAAGACTGAGGCAAAAAACGACTCGGAAATGA
Protein sequenceShow/hide protein sequence
MASSSFAKSIAKFRPLISLSRHSATPTPLPSSSFSFSSAAAVSSAADPQSSSLSQAQQKRRDSKLSKWLLFLPGALTFGLGTWQIFRRQEKIEILDYRQKRLLMEPVNVN
SLLPLKDKLDDLEFRRVICKGVFDEKKSIFVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWVPRTWKEKALEVDHQGSEQSSDTAPSMVQESERSSWWK
FWSKTTKNLENEVSPITPVEVIGVVRTSEKPSIFVPANDPGSSQWFYVDVPAIARTSGLPEDAIYVEDINENVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSA
AVTFMAIKRLRQKTTRK